A-68556 



SIMULTANEOUS SCREENING AND IDENTIFICATION OF SEQUENCE ALTERATIONS 
FROM AMPLIFIED TARGET 

5 FIELD OF THE INVENTION 

The invention relates to methods for screening for and identifying sequence alterations in 
nucleic acids. 

10 

BACKGROUND OF THE INVENTION 

The ability to scan and identify sequence alterations has widespread applications in many 
areas including genetics, immunology, infectious disease, oncology, epidemiology and 
15 forensics. Mutations leading to cancer can arise in a number of different genes and in 

different positions within the same gene. Gene variants also can be the source of hereditary 
diseases. Scanning and identification of such nucleic acid alterations have important 
implications for diagnosis and prognosis and for guiding therapy. 

20 Several methods exist for scanning for the presence or absence of sequence variants. 

Denaturing Gradient Gel electrophoresis (DGGE) takes advantage of differences in the 
melting location of slightly different nucleotide sequences on a gradient gel during 
electrophoresis. (Mers et al., Methods Enzymol. 155:501-527 (1987); Abrams et al., 
Genomics 7:463-475 (1990); Vijg et al., U.S. Patent No. 5,814,491) The process is generally 
25 performed on fragments of a nucleic acid of interest produced through polymerase chain 

reaction (PCR). Similarly, differences in mobility during electrophoresis of single strands of 
nucleic acid is used in the technique known as Single-Strand Conformation Polymorphism 
(SSCP). (Orita et al., Genomics 5:874-879 (1989)) This method takes advantage of different 
conformations assumed by nucleic acids having slightly different sequences under non- 
30 denaturing conditions. But, while these methods can identify nucleic acid sequences with 

alterations, the nature and exact location of the alteration(s) must be subsequently determined 
by other techniques. 

Identification of the location and nature of an alteration in a nucleic acid sequence may be 
35 determined by direct sequencing. However, this process is generally very labor intensive and 

time-consuming. Other means of detecting known sequence alterations involve the use of 
oligonucleotides which hybridize to the specific altered sequence. (Conner et al., Proc. Nat. 
Acad. Sci., 80:278-282 (1983)) This method becomes untenable, however, when an 



alteration is unknown. Other methods utilizing restriction enzymes have been developed to 
identify alterations in a sequence without directly sequencing the nucleic acid. Unfortunately, 
these enzymes recognize only a limited number of restriction sequences. And none of these 
methods provides for quick determination of which alteration is actually present. 

5 

It is apparent that there is a need for a fast and simple method of screening nucleic acids for 
alterations and determining the precise location and nature of such an alteration. 

Accordingly, it is an object of the present invention to provide novel methods of screening for 
10 sequence alterations in a target nucleic acid. The invention provides methods utilizing probes 

which hybridize to nucleic acid to detect alterations in a target sequence as compared to a 
control. 

It is a further object of the invention to provide methods of screening for and determining the 
15 location of a sequence alteration in a target nucleic acid in a single step. These methods 

utilize probes which hybridize to nucleic acid to detect the existence and location of alterations 
in a target sequence. 

An additional object of the invention is to provide methods of screening, determining the 
20 location and determining the nature of a sequence alteration in a target nucleic acid in a single 

step. These methods provide for the identification of specific base changes in the target 
sequence. 

25 SUMMARY OF THE INVENTION 

In accordance with the objectives outlined above, the present invention provides methods of 
screening and identifying alterations in a target nucleic acid sequence as compared with a 
control nucleic acid. Probes are produced which are complementary to and, therefore, 
30 hybridize to overlapping regions of a control nucleic acid. The methods are based on the fact 

that probes directed to a control sequence denature from a target sequence at a different 
temperature than from the control when the target sequence has an alteration (such as a point 
mutation) as compared with the control at the location at which the probe hybridizes. 

35 In one aspect of the present invention, a method is provided for identifying a sequence 

alteration in a target nucleic acid as compared to a control nucleic acid. The method entails 
hybridizing a plurality of nucleic acid probes with said target nucleic acid, wherein said probes 
are complementary to different overlapping regions of said control nucleic acid. The melting 
temperature (TJ of at least two overlapping probes is determined, as well as the difference 
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between the melting temperature of each probe from the target nucleic acid and the control 
nucleic acid (A T m ). The difference between the A T m s of overlapping probes is determined 
as an indication of whether or not a sequence alteration exists in the target nucleic acid as 
compared to the control nucleic acid. In one embodiment, as few as two probes are required 
5 to examine a specific sequence alteration in target nucleic acid as compared with control 

nucleic acid. 

In the method described above, the difference in A T m between at least two overlapping 
probes indicates the location of a nucleotide difference in the target nucleic acid as compared 
10 to the control nucleic acid. In addition, the difference in A T m between at least two overlapping 

probes indicates a substitution in the target nucleic acid sequence as compared to the control 
nucleic acid. Furthermore, the difference in A T m between at least two overlapping probes 
indicates the type of nucleotide substituted in the target nucleic acid sequence as compared to 
the control nucleic acid. 

15 

In another aspect of the invention, provided herein is a method for identifying a sequence 
alteration in a target nucleic acid as compared to a control nucleic acid. The method involves 
hybridizing a plurality of nucleic acid probes with the target nucleic acid; a first set of probes is 
complementary to regions of the control nucleic acid separated by one or more nucleotides 

20 and at least a second set of probes is complementary to regions of the control separated by 

one or more nucleotides. The regions complementary to the second set of probes include the 
nucleic acids separating the first set of probes and are overlapping with the regions 
complementary to the first set of probes. The method further entails determining the melting 
temperature (TJ of at least two overlapping probes from the target nucleic acid, determining 

25 for these at least two overlapping probes the difference between the T m from the target nucleic 

acid and the T m from the control nucleic acid (A TJ, and determining the difference in 
determined A T m between overlapping probes. The difference A T m between overlapping 
probes provides an indication of the presence or absence of a sequence alteration in the 
target nucleic acid as compared to the control nucleic acid. Preferably only two sets of probes 

30 are used. 

Further to the method just described, the difference in A T m between at least two overlapping 
probes indicates the location of a nucleotide difference in the target nucleic acid as compared 
to the control nucleic acid. In addition, the difference in A T m between at least two overlapping 
35 probes indicates a substitution in the target nucleic acid sequence as compared to the control 

nucleic acid. Furthermore, the difference in A T m between at least two overlapping probes 
indicates the type of nucleotide substituted in the target nucleic acid sequence as compared to 
the control nucleic acid. 
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Also with respect to the method just described, in one embodiment only three probes are 
used, two from the first set which are complementary to adjacent regions and one from the 
second probe set which overlaps each of the probes from the first probe set. In this latest 
case, when the probes from the first probe set have a A T m of zero and the probe from the 
second probe set has a A T m other than zero, an alteration in the target is indicated as being 
in the nucleotides of the control nucleic acid sequence separating the regions to which the 
probes from the first probe set are complementary. In the case that one nucleotide separates 
adjacent regions to which the probes of the first probe set are complementary, a A T m of zero 
for two adjacent (non-overlapping) probes and a A T m other than zero for a probe which 
overlaps both adjacent probes indicated that the alteration is at the nucleotide separating the 
adjacent regions. In this last case, the A T m of the overlapping probe indicates both a 
substitution as the sequence alteration and the type of substitution. 



DESCRIPTION OF THE DRAWINGS 

Figure 1 depicts schematically how overlapping probes are walked along a complementary 
single-stranded target such that each probe shares a common sequence with a probe placed 
before and after it. This figure also shows actual experimental data illustrating the relative 
destabilization of overlapping probes in response to single nucleotide alterations in the target 
at different locations in the overlapping region. 

Figure 2 depicts an alternative embodiment of the present invention in which a set of probes 
are walked along the complementary target, separated by a single base between flanking 
probes, and a second set of probes is similarly walked along the target, separated by a single 
base and centered on the bases separating the first set of probes. This figure also presents 
predicted changes in melting temperature of adjacent and overlapping probes in response to 
a single base alteration in the target at different locations. 

Figure 3 shows the effects on the melting temperature of a 15mer oligonucleotide probe from 
an otherwise complementary target in response to different nucleotide pairings at the center 
position of the probe. A Unique melting temperature is seen for each pairing. 

Figure 4 shows the effects on the melting temperature of a 20mer probe from an otherwise 
complementary target in response to a single G:T mismatch at different locations along the 
length of the probe. A unique melting temperature is seen for each location. 

Figure 5 shows A T m for G:T mismatches at 20 different positions within the hybridization 
region of a 20mer probe to a target. This figure illustrates the increasing destabilization of a 
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duplex as a G:T mismatch is moved from the end position of the probe to the center. At each 
position, T m was determined for a completely complementary duplex and the same duplex 
with a G:T mismatch. 

5 Figure 6 shows how the position of a mismatch can be localized using the A T m 's of 2 

overlapping probes to determine A A T m . The two probes had an overlapping hybridization 
region of 10 nucleotides on the targets. When the mismatch is positioned at a similar 
distance from the end of each probe, the A A T m is at or near zero. In comparison, if the 
mismatch is positioned near the end of one probe and the center of another, the A A T m is at 
1 0 its greatest. The direction of the A A T m sign (i.e., + or -) indicates at which end of the 

overlapping region the mismatch is positioned. 

DETAILED DESCRIPTION OF THE INVENTION 

1 5 The present invention provides methods of screening for and identifying sequence alterations 

in a target nucleic acid. These methods utilize probes which are complementary to a control 
nucleic acid. The temperature of thermal denaturation (melting) of these probes when 
hybridized with the target is compared with the melting temperatures of the same probes from 
the control. Surprisingly, determination of melting temperature of the individual probes from 

20 the target and the control provide sufficient information to determine not only the location of an 

alteration, but the nature of a nucleotide substitution. A difference in melting temperature of a 
probe detects an alteration and provides information about the nature of the alteration. 
Differences in melting temperatures of overlapping probes provides information to localize the 
alteration. Advantageously, the screening and identification of the sequence alteration can be 

25 obtained in a single thermal denaturation step. 

It is contemplated that the present invention may be applied to virtually any nucleic acid for 
which the sequence is known. The methods are particularly useful for identifying single 
nucleotide substitutions in a target. 

30 

By "melting temperature" (T m ) is meant the temperature at which a probe dissociates from a 
nucleic acid to which it is hybridized under defined conditions. The conditions are preferably 
chosen so as to provide the maximum change in T m if a mismatch is present. Examples of 
preferred hybridization conditions are conditions used for asymmetric amplification by PCR, so 
35 that no sample handling is necessary. The same solution can then be amplified and melted in 

the same instrument to determine T m values. An example of asymmetric amplification and 
melting analysis is found in Lay and Wittwer, Clin. Chem. 43:2262-2267 (1997), which is 
incorporated herein in its entirety. In one embodiment of the present invention, asymmetric 
amplification is used to produce an excess of one strand of a target or control, so that the 
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overproduced product hybridizes to the probes rather than to its complementary product 
strand. 



In another embodiment of the invention, single strand production and melting analysis is 
5 performed as separate steps. Single strand nucleic acid may be produced by asymmetric 

PCR or other techniques known in the art. Examples of such techniques include self 
sustained sequence replication (3SR) and strand displacement amplification (SDA). The 
single strand target or control is then combined with the probes and T m values are determined. 
In this embodiment, any probe hybridization solution may be used, such solutions being well 
10 known in the art. 

By "probe" is meant any sequence of molecules which is complementary to the control. By 
"complementary" is meant that the probe has a specific sequence of molecules which 
specifically hybridizes on a residue-by-residue basis with a sequence of nucleic acid. Such 
15 molecules include nucleic acids like DNA or RNA, and probes can contain both deoxy- and 

ribonucleic acids (oligonucleotides). The probes may have a ribose-phosphate backbone 
typical of naturally occurring DNA or RNA, but may also contain modifications in such a 
ribose-phosphate backbone. Probes may also be in other forms, for example as peptide 
nucleic acids. 

20 

Means for producing probes having a desired sequence are well known in the art. Preferably, 
each probe has the same number of nucleotides. Preferably, each probe is between 10 and 
50 nucleic acids in length, more preferably between 10 and 30 nucleic acids, still more 
preferably, from 15 to 20 nucleic acids. In one embodiment, each probe is a 15-base nucleic 
25 acid. In another embodiment, each probe is a 20-base nucleic acid. 

In one embodiment of the invention, the probes are completely overlapping. By "overlapping" 
is meant that at least two probes hybridize with the same one or more nucleotides of the 
control nucleic acid. This embodiment will be referred to as the complete overlapping 
30 embodiment. In a preferred embodiment, any one nucleotide of the control is in the 

hybridizing region of exactly two probes. 

In another embodiment, a set of probes is complementary to regions of the control nucleic 
acid such that at least one nucleotide of the control separates each adjacent region, therefore 
35 a spacing of at least one nucleotide separates each adjacent probe in the set when 

hybridized. In this embodiment, at least a second set of probes similarly is complementary to 
regions of the control such that at least one nucleotide separates each adjacent region, each 
probe of the second being complementary to a region of the control containing the one or 
more nucleotides which separate adjacent probes of the first set. This will be referred to as 
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the nucleotide spacing embodiment. In a preferred embodiment, a single nucleotide 
separates each adjacent region in each probe set. 

Any means of determining the melting temperature of a probe may be used. Several means of 
determining the melting temperature of hybridized nucleotides are known in the art. 

In a preferred embodiment, fluorescent double-stranded DNA specific dyes are used to 
determine the melting temperature. In this embodiment, the fluorescent output of a sample of 
probe hybridized with target or control is monitored as the sample is heated. The fluorescent 
output is plotted against the temperature to determine the melting temperature of the probe 
from the target or control. 

In another preferred embodiment, probes comprising a fluorescent label are used. For 
example, the hybridization of 5'-fluorescein-labeled probes, and melting thereof, can be 
observed when the fluorescein label is in close proximity to a guanosine residue on the 
complementary strand, due to the inherent quenching of fluoresceine signal by guanosine. 
(See, Crocket and Wittwer, Anal. Biochem., in press (2001), incorporated herein in its 
entirety). 

A difference in melting temperature of a probe from the target as compared with the melting 
temperature of an identical probe from the control (A TJ indicates an alteration in the target 
sequence as compared to the control. A A T m of a second probe localizes the alteration to 
the region of the target in which the two probes overlap. The difference between A T m of two 
overlapping probes ( A A T m ) indicates the exact position of the base change in the 
overlapping region because this number is unique for each residue location for each type of 
residue change, as further described below. 

The magnitude of A T m for a given probe also indicates the nature of a base change which is 
present. That is, the type of residue (e.g., adenine, guanosine, thymine, cytosine) that is 
different in the target as compared with the control may be determined. In a preferred 
embodiment, determination of the residue change in a target as compared to a control is 
made using a T m profile map, as further described below. A "T m profile map" is a table 
describing the anticipated A T m for each possible residue change under consideration for each 
residue with which a probe hybridizes. The T m profile map may be generated from 
experimental data for specific sequence alterations in a control sequence or from predicted 
T m 's using thermodynamic calculations (See, Santa Luci et al., Proc. Nat. Acad. Sci. USA 
95:1460-1465 (1998) and Schutz et al., BioTechniques 27:1218-1224 (1999)). 



In a preferred embodiment, determination of the residue change in a target as compared to a 
control is made using A T m of two or more probes. Example 3 further illustrates 
determination of the location and type of a single nucleotide change in a target as compared 
to a control. 

5 

In a preferred embodiment, the rate at which the temperature is increased during the 
determination of a melting temperature for a probe is the same for all probes and for each 
probe in determining the melting temperature from the target and from the control. In an 
alternate embodiment, the melting temperature for a probe of interest from a given target is 
10 determined at two or more rates of increasing temperature. 

In a preferred embodiment, melting temperatures of a probe from one or more targets having 
known alterations are determined to produce a T m -alteration profile or A T m map for the probe. 
A "T m profile map" is a table describing the anticipated A T m for each possible residue 

15 change under consideration for each residue with which a probe hybridizes. The T m may be 

determined for alterations at different locations, as well as alterations involving different 
nucleotide substitutions. Additionally, the T m for each of two or more different rates of 
increasing temperature may be determined for one or more alterations, to produce a T m - 
alteration-rate profile. The T m profile map may also be generated from predicted T m 's using 

20 thermodynamic calculations, as is well known in the art. {See, Santa Luci et al., Proc. Nat. 

Acad. Sci. USA 95:1460-1465 (1998); Schutz et al., BioTechniques 27:1218-1224 (1999); and 
Peyret et al., B/oc/7em/sf/y38(12):3468-3477 (1999), each of which is incorporated herein in 
its entirety). 

25 The differences in A T m between the two probes is used to determine the location within the 

overlap region of the two probes. Alternatively, the A T m for each probe may be determined at 
two or more rates of increasing temperature. 

In the single nucleotide spacing embodiment using two sets of probes, usually two probes will 
30 hybridize to the region having any single base mutation and each of said probes will have a A 

T m . However, in the case where a single base change occurs in the gap between the 
hybridization region of two probes of one probe set, only one probe of the other probe set will 
have a A T m . Otherwise, when the A T m of a first probe is subtracted from the A T m of a 
second overlapping probe, the difference in A T m between the overlapping probes (A A TJ 
35 will increase for each alteration that is closer to the end of the first probe. Alternatively, the 

location of the alteration may be determined from the A T m -alteration-rate profile for either or 
both of the first and second probes. 
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In addition to the location of the alteration, the nature of a nucleotide substitution may be 
determined as described above. In one embodiment, a T m -alteration profile for one or both 
overlapping probes with a A T m is used to determine the substitution. 

5 In one embodiment, the invention is implemented in a nucleotide solid support array. Such 

arrays are well known in the art. An example of such an array is described in U.S. Patent No. 
5,571 ,639, incorporated herein in its entirety. 

In a preferred embodiment, the determination of the T m of the probes for a given target and 
10 control is done simultaneously. 

In a preferred embodiment, the invention is automated. In this embodiment specialized 
hardware and software programs may be applied to control the automation of the invention. 
One system useful in the present invention is the commercially available LightCycler™, made 

15 by Idaho Technology (Wittwer et al., BioTechniques 22:176-181 (1997), incorporated herein in 

its entirety). This apparatus is capable of rapid thermal cycling necessary for amplification of 
nucleic acid for procedures such as PCR. This apparatus is also capable of providing a 
probe/control and/or probe/target sample with a prescribed temperature within a very broad 
range and change temperatures at a prescribed rate. The LightCycler™ is also capable of 

20 measuring fluorescence in a sample and changes in fluorescence coincidentally with changing 

temperature for T m determinations. 

The present invention finds uses in many fields, including but not limited to genetics, 
immunology, infectious disease, oncology, epidemiology and forensics. Such uses include, 
25 but are not limited to, identifying mutations in tumorigenic material, identifying inheritable 

genetic disease and guiding treatment for such diseases. Additionally, the invention may be 
used for identifying allelic variants, identifying sources of biological samples and determining 
paternity. 

30 It will be apparent to one of ordinary skill in the art that many other applications of the present 

invention are possible. All references cited herein are incorporated in their entirety. 

The following examples are provided for illustrative purposes. It is understood that these 
examples in no way serve to limit the true scope of the invention. 

35 
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EXAMPLES 



Example 1 

5 Asymmetric PCR is performed by rapid cycling techniques in a reaction volume of 10 ul with 

dNTP's at 200 uM each, one primer at 0.5 uM, the other primer at 0.05 uM, 3 mM Mg 2+ , 50 
mM Tris (pH 8.3), 500 mg/ml BSA. 0.4 U Taq polymerase, and 10 ng of genomic DNA. After 
50 cycles of 94° C for 0 seconds, 55° C for 0 seconds and 72° C for 10 seconds, the sample 
is cooled to 40° C and a high resolution melting curve is performed by heating at 0.05° C per 

10 second in the presence of fluorescent double-stranded DNA-specific dye. 

Melting curves are acquired for all probes and the first derivative of each curve is calculated 
and plotted as -dF/dT (the negative first derivative of fluorescence with respect to 
temperature). T m s are determined as the peak of each derivative curve by Gaussian fitting. 
15 A T m 's (the difference in T m between control and test samples) and AA T m s (the difference in 

A T m when using different probes) are calculated and used to identify sequence alterations in 
the test sample as compared to the control sample. 

Example 2 

20 

Single stranded nucleic acid of a control and a target are separately produced by asymmetric 
PCR, SDA or 3SR. After amplification with a biotinylated primer, the amplified control and 
target are purified by attachment to streptavidin-coated magnetic beads. The purified single 
strand nucleic acid is placed in probe wells or a probe chip and analyzed by melting as in 
25 Example 1 . 

Example 3 

Single nucleotide polymorphism (SNP) screening and identification. 

30 

Two sets of 15-mer nucleic acid probes are produced for use in the nucleotide spacing 
embodiment described above, wherein the number of nucleotides separating adjacent 
complementary regions for each probe set is one (see Fig. 2). If a A T m is observed for only 
one probe then the SNP is localized to the specific base that is unique for that probe. The 
35 magnitude of A T m identifies the specific base change (see Fig. 3). If a A T m is observed for 

two probes, then the SNP is localized to the base positions where the overlap between probes 
occurs. The specific position and base change is identified from a Tm alteration profile map 
that is either measured experimentally or predicted from thermodynamic calculations 
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(SantaLucia, Proc. Natl. Acad. Sci. USA, 95:1460-1465 (1998); Schutz et al., BioTechniques, 
27:1218-1224 (1999)). 



An example of a partial T m alteration profile for the overlapping portion of 2 probes is shown in 
5 Table 1 . There are seven possible positions that overlap between these 1 5-mer probes. The 

partial T m alteration map of Table 1 shows ail possibilities for SNP changes for an A in the 
control DNA (a complete map would contain 3 other Tables of equal size for C, G, and T, but 
the principle is the same). The values of A T m (1 ), A T m (2), and A A T m in the Table are used 
to identify the position and precise SNP that is present. 

10 

| Table 1 . Sequence Map for base A in the control sequence ] 
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With high-resolution determination of the T m s, every possible SNP (there are 84 possibilities 
in the 7 positions, 21 are shown in Table 1) can be identified by a unique value of A T m (1), A 
T m (2), and A A T m . Seldom is such precision required, and often the need is to distinguish 

30 between several sequence possibilities. In this case, a limited sequence map can be 

construction with only the sequence alterations that need to be screened. In this case, lower 
resolution equipment can be used to discriminate between the possibilities (not necessarily 
limited to SNPs). Another method to increase discrimination is to use different melting 
temperature rates (for example 0.1 °C/sec and 0.5°C/sec to estimate T m (Ririe et al., Anal. 

35 Biochem. 245:154-160 (1997), incorporated herein in its entirety). Alternately, both heating 

and cooling curves can be obtained to estimate T m by melting and annealing (Gundry et al., 
Genetic Testing 3:365-370 (1999), incorporated herein in its entirety). When 2 different 
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apparent T m s are measured for each probe/target combination, the matrix will approximately 
double in size, providing twice as many parameters for distinguishing each possible 
genotype. 
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